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This letter considers several physical arguments about con- 
tributions to hydrophobic hydration of inert gases, constructs 
default models to test them within information theories, and 
gives information theory predictions using those default mod- 
els with moment information drawn from simulation of liquid 
water. Tested physical features include: packing or steric 
effects, the role of attractive forces that lower the solvent 
pressure, and the roughly tetrahedral coordination of water 
molecules in liquid water. Packing effects (hard sphere default 
model) and packing effects plus attractive forces (Lennard- 
Jones default model) are ineffective in improving the predic- 
tion of hydrophobic hydration free energies of inert gases over 
the previously used Gibbs and flat default models. However, 
a conceptually simple cluster Poisson model that incorporates 
tetrahedral coordination structure in the default model is one 
of the better performers for these predictions. These results 
provide a partial rationalization of the remarkable perfor- 
mance of the fiat default model with two moments in previous 
applications. The cluster Poisson default model thus will be 
the subject of further refinement. LA-UR-98-5431 

1. Introduction 

The idea of constructing an information theory de- 
scription of cavity formation in water |^ has reinvigo- 
rated the molecular theory of hydrophobic effects . 
One advantage of this approach is that simple physical 
hypotheses can be expressed in a default model. Given 
a fixed amount of specific information, the quality of the 
predictions gives an assessment of the physical ideas that 
are embodied in the underlying default model. Rele- 
vant physical ideas include: whether a direct descrip- 
tion of dense fluid packings significantly improves the 
predictions; or whether incorporation of de- wetting of 
hydrophobic surfaces is required; or whether specific ex- 
pression of the roughly tetrahedral coordination of water 
molecules in liquid water is the most helpful next step 
for these theories. It is remarkable that the previous 
successes of the information models for the primitive hy- 
drophobic effects have not required specific consideration 
of these physical points. 

This letter considers these physical arguments, con- 
structs default models to test them, and gives the re- 
sults of information theory predictions using those de- 
fault models with specific moment information drawn 
from simulation of liquid water. Occupancy moments are 
used as information. Complete moment information pro- 
duces results that are independent of the default model. 
However, the goal is to judge the default models and the 
physical ideas that they express. Therefore, our judge- 
ments will center on the accuracy of the predictions with 



limited moment information. More specifically, we take 
the view that the quality of the prediction with two mo- 
ments is critical because information for the first two oc- 
cupancy moments - the mean and variance - is available 
from experiment. 

Much of the technical work required to construct the 
default models considered involves molecular simulation 
calculations for fiducial systems. That technical work 
will be reported at a later time. 

The application of the information theory approach 
more broadly than to liquid water immediately turns-up 
cases where it works less well. Thus, a broader suite 
of default models will clearly be a key ingredient to the 
broader utility of this approach. 

2. Testing Physical Ideas of Hydrophobic Effects 

The information theory approach studied here grew 
out of earlier studies of formation of atomic sized cavities 
in molecular liquids |p|-|lT|. It has led to new and sim- 
ple views of entropy convergence in hydrophobic hydra- 
tion and of pressure denaturation of proteins ||l3| . A 
review of these developments has been given [ p^ ; broader 
discussions are also available Jl5| , p^ . 

The objective of the information theory prediction is 
the interaction part of the chemical potential of a hard 
core solute /3A/i = — Inpa , where pq the probability 
that the hard solute could be inserted into the system 
without overlap of van der Waals volume of the solvent; 
l//?=kBT. This procedure depends on a default model 
Pn of the distribution p„ of which po is the n = mem- 
ber. Two default models have been considered in previ- 
ous work: (a) the 'Gibbs default model' p„ oc 1/nl that 
predicts a Poisson distribution when the moment <n>Q 
is the only information available; and (b) the 'flat de- 
fault model' Pn = constant>0, n = 0, 1, . . ., Umax, and 
zero otherwise. The predictions obtained using these de- 
fault models for the hydration free energy of inert gases 
in water are similar. Convergence to the correct result 
is non-monotonic with increasing numbers of binomial 
moments Bj = (("))o used jl^jl^. Because of this non- 
monotonic convergence, the most accurate prediction ob- 
tained with a small number of moments utilizes only the 
two moments, Bi and B2. Furthermore, the flat default 
model produces a distinctly more accurate prediction of 
this hydration free energy when only two moments are 
used than does the Gibbs default model. 

The accuracy of the prediction utilizing the flat default 
model is remarkable. Furthermore, the Gibbs default 
model is conceptually more natural in this framework. 
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So, the effectiveness of tlie flat default model relative to 
the Gibbs model is additionally puzzling. The work that 
follows addresses these issues. 

It deserves emphasis that the overall distribution p„ is 
well described by the information theory with the first 
two moments, Bi and i?2- It is the prediction of the 
extreme member po that makes the differences in these 
default models significant. 

2.1. Packing 

A first idea is that the default model should contain a 
direct description of dense fluid packings that are central 
to the theory of liquids |17|. Accordingly, we computed 
Pn for the fluid of hard spheres of diameter d = 2.67 A 
at a density pd^ — 0.633. Those computations used spe- 
cialized importance sampling and will be reported later. 
Typical predictions for the hydrophobic hydration free 
energies of atomic size solutes obtained using those re- 
sults as a default model are shown in Fig. |l|. That shows 
the non-monotonic convergence with increasing number 
of occupancy moments obtained from the flat and the 
Gibbs default models. The predictions obtained using 
the hard sphere results as a default model are different 
but not improved in the essential aspects. Direct conver- 
gence is only seen if four or more moments are included. 
Though the convergence is more nearly monotonic from 
the beginning, the prediction obtained from a two mo- 
ment model is worse than for the flat and the Gibbs de- 
fault cases. 

2.2. Attractive Interactions among Solvent 
Molecules 

A next idea is that attractive forces between solvent 
molecules might play a significant role for these proper- 
ties because attractive forces lower the pressure of the 
solvent. Dehydration of hydrophobic surfaces becomes a 
principal consideration for solutes larger in size than the 
solvent molecules. But perhaps such effects are being felt 
already for atomic solutes. Accordingly, we computed Pn 
for the Lennard- Jones liquid studied by Pratt and Poho- 
rille ||l^ for which attractive interactions were adjusted 
so that the macroscopic pressure of the solvent would 
be approximately zero. This Lennard-Jones system thus 
gives a cavity formation free energy for atomic sized cav- 
ities that is about the same as that of common liquid 
water simulation models. The results of Fig. ^ confirm 
this latter point but also show that the convergence with 
number of moments is again non-monotonic and not bet- 
ter than for the flat and the Gibbs default models. Again, 
direct, non-monotonic convergence is only seen after four 
occupancy moments are included. 
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Figure 1. Convergence with number of binomial mo- 
ments of /3A^ predicted using several default models for 
a spherical solute with distance of closest approach A 
= 3.0 A for water oxygen atoms. Identifications are: 
diamonds (dash-dot lines), hard sphere default; crosses 
(short dash line), Lennard-Jones default; squares (long 
dash fine), Gibbs default; triangles (dotted line), cluster 
Poisson default; circles (gray line), flat default. For this 
value of A, binomial moments Bj are non-zero through 
j = 9. The horizontal line is the prediction with all 
nine moments included. With only two moments the 
Lennard-Jones default model makes the best prediction. 
However, the differences are slight with the exception of 
the hard sphere model. The results for the hard sphere 
default model were obtained from NPT Monte Carlo cal- 
culations at /3 p* = 2.989. The average density was 

= 0.633 ± 0.002 with d = 2.67 A. The Lennard- 
Jones model was obtained from NPT Monte Carlo calcu- 
lations at p=0.0 and T*=1.103. a = 2.67 A and the value 
tJkB—TIi K was obtained from a fit of zero pressure data 

The mean density was p* = 0.624±0.001. A system 
size of 256 particles was sufficient for both simulations. 



2.3. Tetrahedral 
Molecules 



Coordination of Solvent 



The final idea checked here is whether the predictions 
of cavity formation free energies are improved by incor- 
porating a tetrahedral coordination structure for water 
molecules in liquid water. We use a cluster Poisson model 
to accomplish this |0. The physical picture is: tetra- 
hedral clusters of water molecules with prescribed intra- 
cluster correlations but random positions and orienta- 
tions. 

A molecular cluster may contribute to occupants of 
a specific observation volume only if the center of the 
cluster is an occupant of a larger augmented volume; see 
Fig. H. Definition of this augmented volume will depend 
on the structures of the clusters and the choice of cluster 
center. We then consider the generating function p(z) 
for the probability pM that N cluster centers are present 
in the augmented volume: 
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p(z) = z^pn- (1) 

We assume that N is Poisson distributed p{z) = 
^~<N>(i~z) <_N> the product of the density of clus- 
ters and the volume of the augmented region. 




Figure 2. A tetrahedral cluster, the observation vol- 
ume (sphere with solid line, and the augmented volume 
(sphere with dashed line). The cluster may contribute 
occupants of the observation volume only if the center is 
within the augmented volume. 

Next we consider the generating function g{z) defined 
by the conditional probabilities, that a cluster with 
center in the augmented volume contributes n oxygen 
atom occupants to the observation volume: 

g(z) = (2) 

Defining the generating function for the probabilities of 
numbers of oxygen in the observation volume 

p{z) ^ (3) 

we can express 

p{z) = p{g{z)). (4) 

This is a standard result of probability theory [^ . 
Inp(z) is a polynomial function of z. Extraction of the 
series coefficients from Eq. js] provides the desired default 
model. The numerical effort resides only in the compu- 
tation of the gn- 

In this study, the clusters are assumed to be tetrahedra 
with the oxygen atom of a water molecule at the center 
and at each vertex. Thus we take <A^> = pv/5, with v 
the volume of the augmented region and p the molecular 



density of the solvent water. The 00 intra-cluster near- 
neighbor distance, the distance of a point of a tetrahedron 
from its center, is 2.67A and the augmented volume is a 
sphere with radius A -I- 2.67A. The coefficients of ^(z) are 
obtained from a Monte Carlo calculation that randomly 
positions a tetrahedron with center in the augmented vol- 
ume and counts how many 0-points of the cluster land 
in the observation volume. 

Fig. |l| shows the predictions for cavity formation free 
energy obtained with the cluster (tetrahedron) Poisson 
default model. The non-monotonic convergence is still 
evident. The prediction utilizing two moments is more 
accurate than that utilizing the Gibbs default model and 
similar to the predictions made by the flat default or the 
Lennard-Jones default in the best cases considered here 
for those models. 

3. Discussion 

Each of the default models newly considered here 
makes specific assumptions about n-body correlations. 
If the default model were the same as the experimental 
distribution, the limitation of the data to two moments 
would not be significant. The optimization would be un- 
affected by the number of experimental moments used. 

The present results suggests that the efficiency of the 
flat and Gibbs default models relative to the more so- 
phisticated hard sphere and Lennard-Jones default mod- 
els might be associated with the avoidance of specific 
assumptions for n-body correlations for the former cases. 
In this view, the specific assumptions for n-body corre- 
lations with the hard sphere and Lennard-Jones default 
models have to be displaced for a good description of 
cavity formation in liquid water. The third and fourth 
order factorial cumulants predicted on the basis of each 
of these default models using two experimental moments 
were evaluated and directly compared. In fact, the infor- 
mation theory predictions obtained for these moments 
were very similar to each other. 

A second point of discussion is that the biggest differ- 
ence between the Lennard-Jones and the cluster Poisson 
model is in simplicity. Though the differences in the pre- 
dictions seen here are not dramatic, the cluster Poisson 
model is simpler. This is particularly true for the de- 
pendence on thermodynamic state and the potential for 
further development. That the cluster Poisson model ex- 
pressing tetrahedral coordination appears to be a help- 
ful new direction is intuitive and encouraging. However, 
the fact that the predictions are not dramatically im- 
proved suggests that this sort of tetrahedral coordination 
is not the only or principal physical feature relevent for 
improved predictions cavity formation. 

The Lennard-Jones default model incorporates some 
of the dewetting phenomena that is expected to become 
more pronounced as the solute size increases Fig. ^ 
shows the variation of hydration free energy with solute 
size obtained with the different default models and two 
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moments. At the smallest solute size shown, all the mod- 
els give the same result. In the solute size range of 2.2- 
2.8 A, the cluster Poisson model gives the best results 
overall. For larger solute sizes, the cluster Poisson model 
results overestimate the hydration free energy. At this 
point, results from the Lennard- Jones default model cross 
the simulation results and become slightly too small for 
the larger solute sizes shown. 
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Figure 3. The variation of hydration free energy with 
solute size obtained with the different default models. 
The different models are identified as in Fig. ^ In ad- 
dition, simulation results are shown as open diamonds. 
At the smallest solute size shown, all the models give the 
same result. In the solute size range of 2.2-2.8 A, the 
cluster Poisson model gives the best results overall. For 
larger solute sizes, the cluster Poisson model results over- 
estimate the hydration free energy. At this point, results 
from the Lennard- Jones default model cross the simula- 
tion results and become slightly too small for the larger 
solute sizes shown. 

4. Conclusion 

We conclude that direct incorporation of dense fluid 
packing effects (hard sphere default model) and packing 
effects plus attractive forces that lower the pressure of the 



solvent (Lennard-Jones default model) are ineffective in 
improving the prediction of hydrophobic hydration free 
energies of inert gases over the previously used Gibbs and 
flat default models. However, a cluster Poisson model 
that incorporates tetrahedral coordination structure in 
the default model is intuitive, simple to implement, and is 
one of the better performers for these predictions. These 
results provide a partial rationalization of the remarkable 
performance of the flat default model with two moments 
in previous applications. The specific cluster Poisson de- 
fault model used here is primitive and will be the subject 
of further refinement. 
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